Statistical sound source identification in a real acoustic environment for robust speech recognition using a microphone array

نویسندگان

Takanobu Nishiura

Satoshi Nakamura

Kiyohiro Shikano

چکیده

It is very important for a hands-free speech interface to capture distant talking speech with high quality. A microphone array is an ideal candidate for this purpose. However, this approach requires localizing the target talker. Conventional talker localization methods in multiple sound source environments not only have difficulty localizing the multiple sound sources accurately, but also have difficulty localizing the target talker among known multiple sound source positions. To cope with these problems, we propose a new talker localization method consisting of two algorithms. One algorithm is for multiple sound source localization based on CSP (Cross-power Spectrum Phase) analysis. The other algorithm is for sound source identification among localized multiple sound sources towards talker localization. In this paper, we particularly focus on the latter statistical sound source identification among localized multiple sound sources with statistical speech and environmental sound models based on GMMs (Gaussian Mixture Models) and a microphone array towards talker localization.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Blind Source Separation for Speech Application Under Real Acoustic Environment

A hands-free speech recognition system [1] is essential for the realization of an intuitive, unconstrained, and stress-free human-machine interface, where users can talk naturally because they require no microphone in their hands. In this system, however, since noise and reverberation always degrade speech quality, it is difficult to achieve high recognition performance, compared with the case ...

متن کامل

CMSC 660 Project Solutions Optimization methods for Sound Source Localization using Microphone arrays

Microphone arrays are widely employed for applications like teleconferencing, high quality sound capture, speaker recognition/identification, acoustic surveillance, head aid devices, speech acquisition in automobile environments etc. For all these applications the benefits that a microphone array provides over a single microphone are two fold. First using a microphone array we can localize a so...

متن کامل

Speech Recognition of Double Talk using S

Double-talk recognition under a distant microphone condition, a serious problem in speech applications in a real environment, is realized through use of modified SAFIA and acoustic model adaptation or training. The original SAFIA is a high-performance audio segregation method based on band selection using two directivity microphones. We have modified SAFIA by adopting array signal processing an...

متن کامل

Robust cross-correlation-based methods for sound-source localization and separation using a large-aperture microphone array

of “Robust cross-correlation-based methods for sound-source localization and separation using a large-aperture microphone array,” by Hoang T. Do, Ph.D., Brown University, May 2011 Microphone arrays have been used in many applications, such as: teleconferencing, speech recognition, talker characterization, speech enhancement, source localization and separation, etc. Despite the fast-paced develo...

متن کامل

Acoustic Source Localization Based on Beamforming

Sound sources can be localized and analysed using phased microphone arrays.Beamforming is a method for processing microphone array data to produce images that represent the distribution of the acoustic source strength.It is an imaging technique that applies to continuous or discrete source distribution.A microphone array can be designed to be more sensitive to the sound coming from one or more ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2001

Statistical sound source identification in a real acoustic environment for robust speech recognition using a microphone array

نویسندگان

چکیده

منابع مشابه

Blind Source Separation for Speech Application Under Real Acoustic Environment

CMSC 660 Project Solutions Optimization methods for Sound Source Localization using Microphone arrays

Speech Recognition of Double Talk using S

Robust cross-correlation-based methods for sound-source localization and separation using a large-aperture microphone array

Acoustic Source Localization Based on Beamforming

عنوان ژورنال:

اشتراک گذاری